Developing Corpus-Based Translation Methods between Informal and Formal Mathematics: Project Description
نویسندگان
چکیده
The goal of this project is to (i) accumulate annotated informal/formal mathematical corpora suitable for training semi-automated translation between informal and formal mathematics by statistical machinetranslation methods, (ii) to develop such methods oriented at the formalization task, and in particular (iii) to combine such methods with learning-assisted automated reasoning that will serve as a strong semantic component. We describe these ideas, the initial set of corpora, and some initial experiments done over them.
منابع مشابه
Mathematical Methods in System and Software Engineering
Today, there is still a remarkable gap between the techniques and methods used in practice in software engineering and the formal techniques worked out and advocated by academics. Our goal is to close that gap and to bring together the pragmatic and mostly informal ideas in systems and software engineering used in practice and the mathematical techniques for the formal specification, refinement...
متن کاملSemantic Parsing of Mathematics by Context-based Learning from Aligned Corpora and Theorem Proving
We study methods for automated parsing of informal mathematical expressions into formal ones, a main prerequisite for deep computer understanding of informal mathematical texts. We propose a context-based parsing approach that combines efficient statistical learning of deep parse trees with their semantic pruning by type checking and large-theory automated theorem proving. We show that the meth...
متن کاملMining and Modeling Relations between Formal and Informal Chinese Phrases from Web Corpora
We present a novel method for discovering and modeling the relationship between informal Chinese expressions (including colloquialisms and instant-messaging slang) and their formal equivalents. Specifically, we proposed a bootstrapping procedure to identify a list of candidate informal phrases in web corpora. Given an informal phrase, we retrieve contextual instances from the web using a search...
متن کاملتخمین ارزش اقتصاد غیررسمی در ایران برمبنای متغیّرهای زیستمحیطی، رهیافت فیلترکالمن
Informal economy i.e. unrecorded economy, is one of the important problems in developing countries which affects the efficiency of economic activities in formal sector. Informal economy is also an important source of air pollution. This paper aims at estimating informal economy in Iran over the period 1980-2009 based on the mount of CO2 emissions and the country forest areas and using Kalman ...
متن کاملCross-Language Pseudo-Relevance Feedback Techniques for Informal Text
Previous work has shown that pseudo relevance feedback (PRF) can be effective for cross-lingual information retrieval (CLIR). This research was primarily based on corpora such as news articles that are written using relatively formal language. In this paper, we revisit the problem of CLIR with a focus on the problems that arise with informal text, such as blogs and forums. To address the proble...
متن کامل